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REDUCTION OF CM ELLIPTIC CURVES AND MODULAR 
FUNCTION CONGRUENCES 

NOAM ELKIES, KEN ONO AND TONGHAI YANG 


1. Introduction and Statement oe Results 
Let j{z) be the modular function for SL 2 (Z) defined by 


“ ^l + 240I:”,E.|„«-V 


= q ^ + 744 + 196884g + 


where q = The coefficients c{n) possess some striking properties. By “Mon¬ 

strous Moonshine”, these integers occur as degrees of a special graded representation 
of the Monster group, and they satisfy some classical Ramanujan-type congruences. 
In particular, Lehner proved Ci that if p < 7 is prime and m is a positive integer, 
then for every n > 1 we have 


c{p^n) = < 


'o 

(mod 2^™'+®) 

if p 

0 

/ 

(mod 

ifp 

0 

(mod 5™'"''^) 

if p 

.0 

(mod 7™) 

if p 


Modulo 11, it also turns out that c(lln) = 0 (mod 11) for every positive integer n. 
As usual, if U (p) denotes the formal power series operator 


( 1 . 1 ) 


a(n)g”j U(p) := Y aW?'", 


then these congruences imply that 

j{z) I U{p) = 744 (mod p) 

for every prime p < 11. It is natural to ask whether such congruences hold for any 
primes p > 13. 
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Remark. Since the Hecke operator T{p) acts like U{p) on spaces of holomorphic integer 
weight modular forms modulo p, this problem is somewhat analogous to that of 
determining whether there are inhnitely many non-ordinary primes for the generic 
integer weight newform without complex multiplication. Apart from those newforms 
associated to modular elliptic curves, for which the existence of inhnitely many non¬ 
ordinary primes was shown in jElj, little is known. 

Serre showed jHil that the answer to this question for j{z) is negative, an ob¬ 
servation which has been generalized by the second author and Ahlgren [AOj . In 
particular, if F{x) G Z[a;] is a polynomial of degree m > 1 and p > 12m-|-1 is a prime 
which does not divide the leading coefficient of F{x), then (see Corollary 5 of |A( )j ) 

F{j{z)) \U{p)^ a(0) (mod p), 

where a(0) is the constant term in the Fourier expansion of F{j{z)). 

Remark. In the cuspidal case where {m,p) = (1,13), it is interesting to note that (for 
example, see Section (6.16) or [Sij) 

{j{z) - 744) I 7/(13) = -A(; 2 ) (mod 13), 
where A(^) = ~ is the usual Delta-function. 

Here we investigate the more general question concerning the existence of congru¬ 
ences of the form 

(1.2) F{j{z)) \ U{p) = Gp{j{z)) (modp), 

where Gp{x) G Z[x]. The result quoted above implies that congruences of the form 
(O do not hold for any primes p > 12m -|- 1. This follows from the simple fact 
that the only polynomials in j{z) whose Fourier expansions do not contain negative 
powers of q are constant. 

In Section |21 we give a general criterion (Theorem 12. 3 j) that proves such congru¬ 
ences, and we apply it to Hilbert class polynomials. For a discriminant —D < 0, 
let Hd{x) G Z[a:] be the associated Hilbert class polynomial. More precisely, T-Cd{x) 
is the polynomial of degree h{—D) whose roots are the singular moduli of discrimi¬ 
nant —D, where h{—D) is the class number of the ring of integers Od of Q{y/—D). 
By the theory of complex multiplication, these singular moduli are the j-invariants 
of those elliptic curves that have complex multiplication hy Od-, the ring of integers 
ofQ(v/^). 

Although there are no congruences of the form (O for F{x) = Hd{x) involving 
primes p > 12h[—D) + l, we show that such congruences are quite common for smaller 
primes, as Lehner demonstrated for F[x) = Tizi.x) = x. 
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Theorem 1.1. Suppose that —D < 0 is a fundamental discriminant, and that integers 
Coin) are defined by 

OO 

nnUiz)) = CD{n)q'^. 

n=—h{—D) 

(1) If p < 11 is prime and h{—D) < p, then 

'HoUi.z)) I U{p) =czj(0) (modp). 

(2) For every prime p there is a non-positive integer Np such that 

T-CD{j{z)) I U{p) = Gp{j{z)) (mod p), 

for some Gp{x) G 1^[x], provided that —D < Np and (^) 7 ^ 1. This poly¬ 
nomial Gp{x) has degree < h{—D)/p. In particular, Gp{x) is constant if 
h{—D) < p, or more generally if Cni—pn) = 0 (mod p) for every integer 
n > 0. 

Proving Theorem II ■lh 2i depends heavily on the interplay between singnlar modnli 
and snpersingular j-invariants. For a prime p > 5, dehne the snpersingular loci Sp{x) 
and Sp{x) in Fp[x] by 

5'p(a:) := JJ {x-j{E)), 

E/¥p supersingular 

E /Fp supersingular 
i(i?)7{0,1728} 

These prodncts are over isomorphism classes of snpersingnlar elliptic curves. It is a 
classical fact (for example, see |Silj i that the degree of Sp{x) is [p/12j. 

The criterion for proving congruences of the form (El is stated in terms of the 
divisibility of F{x) by Sp{xfi in Fp[a;]. For Hilbert class polynomials, this criterion is 
quite natural since a classical theorem of Deuring asserts that the reduction of every 
discriminant —D singular modulus modulo p is a snpersingular j-invariant when p 
does not split in Q(-\/—H). 

Therefore, to prove Theorem II .11 we are forced to consider the surjectivity of Deur- 
ing’s reduction map of singular moduli onto snpersingular j-invariants, a question 
which is already of signihcant interest. Using classical facts about elliptic curves 
with CM and certain quaternion algebras, we reinterpret this problem in terms of 
the vanishing of Fourier coefficients of specihc weight 3/2 theta functions constructed 
by Gross. Then, using deep results of Duke and Iwaniec which bound coefficients of 
half-integral weight cusp forms, we obtain the following theorem. 
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Theorem 1.2. If p is an odd prime and t > 1, then there is a non-positive integer 
Np{t) such that Sp{xy \ in Fp[a;] for every fundamental discriminant —D < 

Np\t) for which (^) ^ 1. 

Three Remarks. 1. After this paper was submitted for publication, Bill Duke and 
the referee informed us of earlier work of P. Michel jMij . In this recent paper, Michel 
obtains equidistribution results which imply Theorem II .21 for discriminants for which 
p is inert. His proof, which is different from ours, is based on subconvexity bounds 
for L-functions. Our proof, which also includes the ramified cases, is based on non¬ 
trivial estimates of Fourier coefficients of half-integral weight cusp forms. Both proofs 
are somewhat related via Waldspurger’s formulas connecting values of L-functions to 
Fourier coefficients. 

2. Theorem o is ineffective due to the ineffectivity of Siegel’s lower bound for 
class numbers. It is possible to obtain effective results by employing various Riemann 
hypotheses (for example, see work ra by the second author and Soundararajan 
concerning Ramanujan’s ternary quadratic form), or by assuming the non-existence 
of Landau-Siegel zeros. 

3. Theorem o is closely related to the work of Gross and Zagier |GZj which 
provides the prime factorization of norms of differences of singular moduli in many 
cases. 

In Section |21 we use a result of Koike (arising in his study of “p-adic rigidity of 
j{zyy to prove Theorem 12.31 In Section |21 we recall preliminary facts regarding 
endomorphism rings of elliptic curves with complex multiplication and quaternion 
algebras, and we prove Theorem 11.21 nsiuEf facts about weight 3/2 Eisenstein series 
combined with the Duke-Iwaniec bounds for coefficients of weight 3/2 cusp forms. 
Then we combine these results with Theorem 12.31 to prove Theorem ll.il In Section |3] 
we conclude with some remarks on numerical calculations related to Theorems irn 
and 11.21 

2. A CONGRUENCE CRITERION AND SUPERSINGULAR j-INVARIANTS 

Here we give a simple criterion which implies congruences of the form (II.2j) . This 
criterion is a simple generalization of Theorem 2 of jAOj . The following result of 
Koike (see Proposition 1 of jKoij h which is a special case of work of Dwork and 
Deligne |Dwj . describes the Fourier expansion of j{pz) (mod p^) in terms of j{z) and 
the collection of supersingular j-invariants. Since clearly j{pz) = j{zy mod p, we can 
describe the Fourier expansion of j{pz) (mod p^) via the reduction modulo p of the 
expansion of {j{pz) — j{zy)/p. 

Theorem 2.1. For each prime p there is a rational function 5p{x) = Np{x)/Sp{x), 
with Np G Fp[a:], such that 

(2.1) j{pz) = j{zy + p6p{j{z)) (mod p^). 


REDUCTION OF CM ELLIPTIC CURVES AND MODULAR FUNCTION CONGRUENCES 5 


Corollary 2 . 2 . For all F G 7a[x] we have 

(2.2) F{j{pz)) = F{j{zy) + pF'{j{zY)6p{j{z)) (mod /), 
where F' denotes the derivative of F. 

Proof. By linearity it suffices to prove this for F = (/c = 0,1, 2,...). The case fc = 0 

is trivial; fc = 1 is Theorem o and for A; > 1 we may either raise the congruence in 
Theorem ED to the power k and reduce mod or argue by induction from the case 
k — 1 and the same congruence, obtaining 

(2.3) ]{pzf = j{zY^ + (mod p^) 

as claimed. □ 

We can now deduce our congruence criterion using little more than the theory of 
Hecke operators. 

Theorem 2 . 3 . Let F{x) G Jfx] he a monic polynomial of degree m, and let 

OO 

a{n)q^. 

n=—m 

(1) If p is prime and Sp{xY divides F{x) in Fp[x], then 

F{j{z)) I U{p) = Gp{j{z)) (mod p), 

for some Gp{x) G Z[x] with degree < m/p. 

(2) If p < 11 is prime and m < p, then 

F{j{z)) I U{p) = a(0) (mod p), 

where a(0) is the constant term in the Fourier expansion of F{j{z)). 

Proof. For (1), let 

CX) 

F{j{z)) = Y 

n=—oo 

Denote by To(p) the operator pT{p), that is, p times the usual weight zero pth Hecke 
operator. Then we have 

OO 

(2.4) pF{j{z)) I U{p) = F{j{z)) I To{p) - F{j{pz)) = P Y 

n=—oo 

The modular function F{j{z)) \ Tq{p) is in Z[j(z)] since it has integer Fourier co¬ 
efficients and is holomorphic on H. Since Sp{xY\F{x) we have Sp{x)\F'{x), whence 
also Sp{x)\F'(x^). By Corollary 12.21 it follows that F{j{pz)) (mod p^) is congruent 
modulo p^ to an integer polynomial in i{z), namely F{i{z)y -|-pF'(j(z)^)hp(j(z)). 
Thus (I2.4|l yields the desired congruence modulo p between F{j{z)) \ U{p) and a 
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polynomial in j{z). Moreover, this polynomial mnst have degree < m/p becanse 
F{j{z)) I U{p) has valnation > —m/p at the cnsp. 

For (2), we observe that the condition Sp{x)‘^\F{x) of (1) is vacnons for p < 11, 
becanse Sp{x) = 1 for those p. Thus F{j{z)) \ U{p) is always congruent mod p to a 
polynomial in j of degree at most m/p. In particular, if m < p then this polynomial 
reduces to a constant, which must equal a(0) by the dehnition of U{p). 

□ 


3. Gross’ theta functions and the proofs of Theorems 11.11 and 11.21 

Throughout, p shall denote a prime. We begin by recalling certain facts about 
elliptic curves with complex multiplication (for example, see Chapter II of jSi2j b Let 
B be the unique quaternion algebra over Q ramihed exactly at p and cx). For x G S, 
let Q{x) = XX = —x^, the reduced norm of x; the map Q : B ^ Qis a. quadratic form 
on B, which is positive-dehnite because B is ramihed at cxo. Fix a maximal order 
R G B. Then Q takes integer values on R, and since B is ramihed at p, the subset 

71 := {x e R \ p|Q(x)} 

of i? is a two-sided ideal with R/tt a hnite held of p^ elements and = pR. 

Let K = Q{\/—D) be an imaginary quadratic held with ring of integers Oo- More 
generally, for a positive integer m congruent to 0 or 3 mod 4, let 

Om = Z + y/-m)Z, 

the order of discriminant —m in Q{y/—m). An optimal embedding of Om into R is 
an embedding in i : Q(\/—m) B for which i~^{R) = Om- Such an embedding is 
determined by the image of \/—m in 

Id = {x G 5 I tr X = 0}, 

a 3-dimensional subspace of B. This image must be an element of norm m in the 
lattice 

L ■=VG{Z + 2R) 

in V, and conversely every v G L of norm m comes from an embedding Q(\/—m) ^ B, 
which is optimal if and only if n is a primitive vector of L (that is, v ^ fL for any 
/>!)• 

Two optimal embeddings A, *2 are equivalent if they are conjugate to each other 
by a unit in i?; In other words, if there is m G R^ such that ii{x) = ui 2 {x)u~^ for all 
X G Om- Let h{Om, R) be the number of equivalence classes of optimal embeddings 
of Om into R. Using the connection between embeddings and lattice vectors. Gross 
proved |Grj that these numbers generate the theta series of L as follows: 
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Lemma 3.1. ( jGrj . Proposition 12.9) The theta function 

Gl{z) ;= ^e(Q(a;)) 

x£L 


is given by 

9l{z) = 1 + 5 ^ 

m>l 


where 




= ^R 

m=pD 


h{OD,R) 
u{d) ’ 


in which u{D) = \ifOf and = ffR^. Moreover, 6 l is a holomorphic modular 
form of weight 3/2 and level 4p. 


Remark. This Oi{z) is a modular form of half-integral weight in the sense of Shimura |Shj . 
Moreover, it lies in Kohnen’s plus-space lEEh]. 


Now recall that every maximal order of B is isomorphic with the endomorphism 
ring of some supersingular elliptic curve Eq over fc = Fp, say R = End(ii^o)) with the 
ideal vr comprising the inseparable endomorphisms of Eq. Let Cp be the completion 
of an algebraic closure of Qp. The residue held of the unramihed quadratic extension 
of Qp in Cp is a hnite held of elements; call it k. There is then a canonical map 
R ^ k, X ^ X, dehned as follows: any x E R = End(Eo) induces multiplication by 
X on the invariant diherentials of Eq. The kernel {x | T = 0} is our ideal tt, so this 
map X ^x identihes k with R/tt. 

Suppose that {E, i) is a CM elliptic curve over Cp with complex multiplication 
by (Pi). We may then choose a map 

i-.Oo^ End(E) 

which is normalized in the sense that any a E Od acts on the invariant diherentials 
of E by multiplication by a. (There are two choices of i, related by conjugation in 
Gal(/L/Q), and one of them is normalized.) If p is inert or ramihed in K, then a 
classical result of Deuring states that E := E mod p is a supersingular elliptic curve 
over k, and if E = Eq, then we obtain an optimal embedding 

f :Od = End(E) ^ End(E) = End(Eo) = R, 

and moreover the embedding is normalized: if a; = /(a) then x is the residue of a 
in k. Any embedding equivalent to a normalized one is again normalized, because 
conjugation by a unit in R^ commutes with our map x x. Since any two isomor¬ 
phisms E = Eq diher by multiplication by a unit in R^ , one sees that the equivalence 
class of / is uniquely determined by {E,i). Conversely, given a normalized optimal 
embedding / : Od —> R, Deuring’s lifting theorem (see |GZ1 Proposition 2.7]) asserts 
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that there is a CM elliptic curve {E, i), unique up to isomorphism, such that E = Eo, 
and its associated optimal embedding is normalized and equivalent to /. 

Note now that embeddings of Ok into R come in conjugate pairs where 

t{a) = i{a) = i{a). If p is inert in K, then in each pair {i,t} exactly one of i,t is 
normalized, whereas if p is ramihed in K then every embedding is normalized. We 
have thus proved the following lemma. 


Lemma 3.2. Let Jo be the set of j-invariants of CM elliptic curves with endomor¬ 
phism ring Od- If we set 

Jd{Eq) = {j e Jd\ 3 mod p = HEq)], 

then 

ifJD{Eo)=eh{OD,R), 

where R = End(L^o) e = 1/2 or 1 according as p is inert or ramified in K. 


Proof of Theorem M JA For every prime p, there are finitely many supersingular elliptic 
curves Eq over Fp. So to prove the theorem it suffices to show, for each supersingular 
curve Eq with R = End(Fo) and each positive integer t, that there is a non-positive 
integer Np(t) such that every fundamental discriminant —D < Np{t) with (^) ^ 1 
has the property that OTdx=j(Eo)(IlD{x) mod p) > t. By Lemmas 13.11 and EI21 this is 
equivalent to 


( 3 . 1 ) aoiD) > -f- 

eu[d) 

This turns out to be a simple consequence of well-known deep results of Siegel jHIj, 
Duke jPuj . and Iwaniec (T^. Indeed, one has by 0 

(3.2) ^ l (^) = p _ Y ^Sen-{L) {z) + Cl{z), 

where Egen{L){z) is the Eisenstein series associated to the genus of the lattice L and 
Cl{z) is a cusp form of weight 3/2 and level 4p. Although Siegel’s result is not stated 
for forms of half-integral weight forms, the proof follows mutatis mutandis, with the 
constant 12/(p — 1) coming from Gross’ explicit calculation of these Eisenstein series. 
More precisely, he shows in jCrl (12.11)] (this is 2G in his notation, see also jKRYl 
§8]) that 

(3.3) = ^ + 2 Y 

m>0 


where Hp{m) is a slight modihcation of Kronecker-Hurwitz class number H{m) de- 
hned in |Crl (1.8)]. In particular, when —D is a fundamental discriminant, one has 


-D\\ h{-D) 
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where h{—D) is the ideal class number of K. 

By Siegel’s theorem m, one has 

Hp{D) > 

for every e > 0 when (^) 7 ^ 1. On the other hand, a theorem of Duke |Dnj . which 
extended earlier work of Iwaniec jlwj , implies that the coefficients of the cusp form 

Cl{z) = '^CL{m)q^ 

m>l 


satisfy 

|ci(D)|«Df+h 

Since 3/7 < 1 / 2 , we get 

+ cl{D) > 

This proves eu, and consequently completes the proof of the theorem. □ 


Proof of Thoorem M . 1i Theorem II .1! (1) follows immediately from Theorem 12.31 (1). 
Theorem 11.11 (2) follows from Theorem 12.31 (2) and Theorem Oby letting Np = 
Np{2). □ 


4. Concluding Remarks 

Numerical computations reveal many nearly uniform sets of examples of congru¬ 
ences of the form (d. Here we comment on those cases where 

UD{j{z)) I U{p) = cd( 0) (modp). 

In view of Theorem El it is natural to investigate those fundamental discriminants 
-D < 0 for which 

(4.1) p/6<h{—D)<p. 

The lower bound of this inequality is dictated by the fact that the degree of Sp{x) is 
[p/12j, and the upper bound is chosen so that the U{p) operator does not produce a 
Fourier expansion with negative powers of q. 

Computations reveal that if —239 < —D < 0 and p is an odd prime satisfying dUD 
for which 7 ^ 1, then Sp{xY divides 1-Ld{x) in Fp[x], which, by Theorem 12.31 in 
turn implies that 

HDiiiz)) I U{p) = cd( 0) (modp). 

This uniformity suggests that this phenomenon might hold in generality. However, 
this is not true; when —D = —239, we have 

n-2mU{z)) I c(79) = 44 + 2g + 62^^ + ... (mod 79), 
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although 79 is inert in Q(V—239) and h{—239) = 15. In this case Srgi^x) divides 

"^ 239 ( 2 ^) in lF 79 [a;], but the supersingular j-invariant j = —15 is a root of multiplicity 

only 1. This raises the following natural question. 

Question. If p is an odd prime, then define Qp by 

flp := fundamental | p/6 < h{—D) < p and ^^ ^ 

In general, what “proportion” of —D G flp have the property that 

HoUiz)) \U{p) = cd{0) (mod p)? 
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